<!DOCTYPE html>

<html>

<head>

<meta charset="utf-8" />
<meta name="generator" content="pandoc" />
<meta http-equiv="X-UA-Compatible" content="IE=EDGE" />

<meta name="viewport" content="width=device-width, initial-scale=1" />

<meta name="author" content="Massimo Aria" />

<meta name="date" content="2020-09-28" />

<title>Bibliometrix: Data Importing and Converting</title>



<style type="text/css">code{white-space: pre;}</style>
<style type="text/css" data-origin="pandoc">
code.sourceCode > span { display: inline-block; line-height: 1.25; }
code.sourceCode > span { color: inherit; text-decoration: inherit; }
code.sourceCode > span:empty { height: 1.2em; }
.sourceCode { overflow: visible; }
code.sourceCode { white-space: pre; position: relative; }
div.sourceCode { margin: 1em 0; }
pre.sourceCode { margin: 0; }
@media screen {
div.sourceCode { overflow: auto; }
}
@media print {
code.sourceCode { white-space: pre-wrap; }
code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
}
pre.numberSource code
  { counter-reset: source-line 0; }
pre.numberSource code > span
  { position: relative; left: -4em; counter-increment: source-line; }
pre.numberSource code > span > a:first-child::before
  { content: counter(source-line);
    position: relative; left: -1em; text-align: right; vertical-align: baseline;
    border: none; display: inline-block;
    -webkit-touch-callout: none; -webkit-user-select: none;
    -khtml-user-select: none; -moz-user-select: none;
    -ms-user-select: none; user-select: none;
    padding: 0 4px; width: 4em;
    color: #aaaaaa;
  }
pre.numberSource { margin-left: 3em; border-left: 1px solid #aaaaaa;  padding-left: 4px; }
div.sourceCode
  {   }
@media screen {
code.sourceCode > span > a:first-child::before { text-decoration: underline; }
}
code span.al { color: #ff0000; font-weight: bold; } /* Alert */
code span.an { color: #60a0b0; font-weight: bold; font-style: italic; } /* Annotation */
code span.at { color: #7d9029; } /* Attribute */
code span.bn { color: #40a070; } /* BaseN */
code span.bu { } /* BuiltIn */
code span.cf { color: #007020; font-weight: bold; } /* ControlFlow */
code span.ch { color: #4070a0; } /* Char */
code span.cn { color: #880000; } /* Constant */
code span.co { color: #60a0b0; font-style: italic; } /* Comment */
code span.cv { color: #60a0b0; font-weight: bold; font-style: italic; } /* CommentVar */
code span.do { color: #ba2121; font-style: italic; } /* Documentation */
code span.dt { color: #902000; } /* DataType */
code span.dv { color: #40a070; } /* DecVal */
code span.er { color: #ff0000; font-weight: bold; } /* Error */
code span.ex { } /* Extension */
code span.fl { color: #40a070; } /* Float */
code span.fu { color: #06287e; } /* Function */
code span.im { } /* Import */
code span.in { color: #60a0b0; font-weight: bold; font-style: italic; } /* Information */
code span.kw { color: #007020; font-weight: bold; } /* Keyword */
code span.op { color: #666666; } /* Operator */
code span.ot { color: #007020; } /* Other */
code span.pp { color: #bc7a00; } /* Preprocessor */
code span.sc { color: #4070a0; } /* SpecialChar */
code span.ss { color: #bb6688; } /* SpecialString */
code span.st { color: #4070a0; } /* String */
code span.va { color: #19177c; } /* Variable */
code span.vs { color: #4070a0; } /* VerbatimString */
code span.wa { color: #60a0b0; font-weight: bold; font-style: italic; } /* Warning */

</style>
<script>
// apply pandoc div.sourceCode style to pre.sourceCode instead
(function() {
  var sheets = document.styleSheets;
  for (var i = 0; i < sheets.length; i++) {
    if (sheets[i].ownerNode.dataset["origin"] !== "pandoc") continue;
    try { var rules = sheets[i].cssRules; } catch (e) { continue; }
    for (var j = 0; j < rules.length; j++) {
      var rule = rules[j];
      // check if there is a div.sourceCode rule
      if (rule.type !== rule.STYLE_RULE || rule.selectorText !== "div.sourceCode") continue;
      var style = rule.style.cssText;
      // check if color or background-color is set
      if (rule.style.color === '' && rule.style.backgroundColor === '') continue;
      // replace div.sourceCode by a pre.sourceCode rule
      sheets[i].deleteRule(j);
      sheets[i].insertRule('pre.sourceCode{' + style + '}', j);
    }
  }
})();
</script>



<style type="text/css">body {
background-color: #fff;
margin: 1em auto;
max-width: 700px;
overflow: visible;
padding-left: 2em;
padding-right: 2em;
font-family: "Open Sans", "Helvetica Neue", Helvetica, Arial, sans-serif;
font-size: 14px;
line-height: 1.35;
}
#TOC {
clear: both;
margin: 0 0 10px 10px;
padding: 4px;
width: 400px;
border: 1px solid #CCCCCC;
border-radius: 5px;
background-color: #f6f6f6;
font-size: 13px;
line-height: 1.3;
}
#TOC .toctitle {
font-weight: bold;
font-size: 15px;
margin-left: 5px;
}
#TOC ul {
padding-left: 40px;
margin-left: -1.5em;
margin-top: 5px;
margin-bottom: 5px;
}
#TOC ul ul {
margin-left: -2em;
}
#TOC li {
line-height: 16px;
}
table {
margin: 1em auto;
border-width: 1px;
border-color: #DDDDDD;
border-style: outset;
border-collapse: collapse;
}
table th {
border-width: 2px;
padding: 5px;
border-style: inset;
}
table td {
border-width: 1px;
border-style: inset;
line-height: 18px;
padding: 5px 5px;
}
table, table th, table td {
border-left-style: none;
border-right-style: none;
}
table thead, table tr.even {
background-color: #f7f7f7;
}
p {
margin: 0.5em 0;
}
blockquote {
background-color: #f6f6f6;
padding: 0.25em 0.75em;
}
hr {
border-style: solid;
border: none;
border-top: 1px solid #777;
margin: 28px 0;
}
dl {
margin-left: 0;
}
dl dd {
margin-bottom: 13px;
margin-left: 13px;
}
dl dt {
font-weight: bold;
}
ul {
margin-top: 0;
}
ul li {
list-style: circle outside;
}
ul ul {
margin-bottom: 0;
}
pre, code {
background-color: #f7f7f7;
border-radius: 3px;
color: #333;
white-space: pre-wrap; 
}
pre {
border-radius: 3px;
margin: 5px 0px 10px 0px;
padding: 10px;
}
pre:not([class]) {
background-color: #f7f7f7;
}
code {
font-family: Consolas, Monaco, 'Courier New', monospace;
font-size: 85%;
}
p > code, li > code {
padding: 2px 0px;
}
div.figure {
text-align: center;
}
img {
background-color: #FFFFFF;
padding: 2px;
border: 1px solid #DDDDDD;
border-radius: 3px;
border: 1px solid #CCCCCC;
margin: 0 5px;
}
h1 {
margin-top: 0;
font-size: 35px;
line-height: 40px;
}
h2 {
border-bottom: 4px solid #f7f7f7;
padding-top: 10px;
padding-bottom: 2px;
font-size: 145%;
}
h3 {
border-bottom: 2px solid #f7f7f7;
padding-top: 10px;
font-size: 120%;
}
h4 {
border-bottom: 1px solid #f7f7f7;
margin-left: 8px;
font-size: 105%;
}
h5, h6 {
border-bottom: 1px solid #ccc;
font-size: 105%;
}
a {
color: #0033dd;
text-decoration: none;
}
a:hover {
color: #6666ff; }
a:visited {
color: #800080; }
a:visited:hover {
color: #BB00BB; }
a[href^="http:"] {
text-decoration: underline; }
a[href^="https:"] {
text-decoration: underline; }

code > span.kw { color: #555; font-weight: bold; } 
code > span.dt { color: #902000; } 
code > span.dv { color: #40a070; } 
code > span.bn { color: #d14; } 
code > span.fl { color: #d14; } 
code > span.ch { color: #d14; } 
code > span.st { color: #d14; } 
code > span.co { color: #888888; font-style: italic; } 
code > span.ot { color: #007020; } 
code > span.al { color: #ff0000; font-weight: bold; } 
code > span.fu { color: #900; font-weight: bold; } 
code > span.er { color: #a61717; background-color: #e3d2d2; } 
</style>




</head>

<body>




<h1 class="title toc-ignore">Bibliometrix: Data Importing and Converting</h1>
<h4 class="author">Massimo Aria</h4>
<h4 class="date">2020-09-28</h4>



<p> </p>
<p> </p>
<div id="a-common-workflow-to-search-and-export-bibliographic-documents" class="section level2">
<h2>A common workflow to search and export bibliographic documents</h2>
<p>All databases follow a common workflow to search and export bibliographic collections, mainly based on three steps: <em>1. Write and submit a query</em>, <em>2. Refine search results</em> and <em>3. Export search results</em>.</p>
<p> </p>
<div id="write-and-submit-a-query" class="section level3">
<h3><strong>1. Write and submit a query</strong><br />
</h3>
<p>A query is usually based on a set of terms linked by boolean operators. The search engine will query the db to identify records matching the query.<br />
i.e. TI = (bibliometric AND analysis), the search engine will search for all the records in which the title will contain the words ‘bibliometric’ and ‘analysis’ simultaneously.</p>
<p> </p>
</div>
<div id="refine-search-results" class="section level3">
<h3><strong>2. Refine search results</strong><br />
</h3>
<p>Search results can be refined by applying some filtering criteria for additional fields.<br />
i.e. selecting Document Type = ‘Journal Article’ AND Language = ‘English’ AND Timespan = 1990:2020 AND Subject Category = ‘Management’</p>
<p> </p>
</div>
<div id="export-search-results" class="section level3">
<h3><strong>3. Export search results</strong><br />
</h3>
<p>In this step, the user must choose which metadata he wants to download and the export file format to save the results.<br />
</p>
<pre><code>To be able to work, *bibliometrix* requires a minimum set of mandatory metadata  (i.e. Authors&#39; name, Title, Journal title, Affiliation, Publication year, etc.).

Our advice is to always select all the metadata fields to be sure you can perform all the analyses implemented in bibliometrix.</code></pre>
<p>Many databases support a variety of different export formats, some commercial (i.e. EndNote, Mendeley, etc.) and some standard (i.e. html, plaintext, BibTeX, etc.). The choice of a suitable export file will have to consider Table 1.<br />
</p>
<p> </p>
<p> </p>
</div>
</div>
<div id="supported-data-sources" class="section level2">
<h2>Supported data sources</h2>
<p><strong><em>Bibliometrix</em></strong> can import bibliographic database files and references manager files, or can download data through APIs. </p>
<p>Bibliometrix supports bibliographic database files from <em>Web of Science</em>, <em>Scopus</em>, <em>Dimensions</em>, <em>PubMed</em> and <em>Cochrane Library</em>.</p>
<p>In Table 1, we report the complete list of export file formats supported by bibliometrix for each database.</p>
<table>
<caption><strong>Table 1.</strong> <em>Export file formats supported by bibliometrix</em></caption>
<colgroup>
<col width="18%"></col>
<col width="36%"></col>
<col width="28%"></col>
<col width="15%"></col>
</colgroup>
<thead>
<tr class="header">
<th>Source</th>
<th>URL</th>
<th>Format</th>
<th>Extension</th>
</tr>
</thead>
<tbody>
<tr class="odd">
<td>Web of Science</td>
<td><a href="https://www.webofknowledge.com/" class="uri">https://www.webofknowledge.com/</a></td>
<td><ul>
<li>‘BibTeX’</li>
<li>‘plaintext’</li>
<li>’EndNote Desktop</li>
</ul></td>
<td><ul>
<li>‘.bib’</li>
<li>‘.txt’</li>
<li>‘.ciw’</li>
</ul></td>
</tr>
<tr class="even">
<td>Scopus</td>
<td><a href="https://www.scopus.com/" class="uri">https://www.scopus.com/</a></td>
<td><ul>
<li>‘BibTeX’</li>
<li>‘CSV export’</li>
</ul></td>
<td><ul>
<li>‘.bib’</li>
<li>‘.txt’</li>
</ul></td>
</tr>
<tr class="odd">
<td>Dimensions</td>
<td><a href="https://app.dimensions.ai/" class="uri">https://app.dimensions.ai/</a></td>
<td><ul>
<li>‘Bibliometric mapping’</li>
<li>‘Excel’</li>
</ul></td>
<td><ul>
<li>‘.csv’</li>
<li>‘.xlsx’</li>
</ul></td>
</tr>
<tr class="even">
<td>PubMed</td>
<td><a href="https://pubmed.ncbi.nlm.nih.gov/" class="uri">https://pubmed.ncbi.nlm.nih.gov/</a></td>
<td><ul>
<li>‘PubMed export file’</li>
</ul></td>
<td><ul>
<li>‘.txt’</li>
</ul></td>
</tr>
<tr class="odd">
<td>Cochrane Library</td>
<td><a href="https://www.cochranelibrary.com/" class="uri">https://www.cochranelibrary.com/</a></td>
<td><ul>
<li>‘plaintext’</li>
</ul></td>
<td><ul>
<li>‘.txt’</li>
</ul></td>
</tr>
</tbody>
</table>
<p>Furthermore, bibliometrix provides support for the APIs (application programming interfaces) of Dimensions, NCBI PubMed and Scopus, using functions from packages dimensionsR (<a href="https://github.com/massimoaria/dimensionsR" class="uri">https://github.com/massimoaria/dimensionsR</a>), pubmedR (<a href="https://github.com/massimoaria/pubmedR" class="uri">https://github.com/massimoaria/pubmedR</a>) and rscopus (<a href="https://github.com/muschellij2/rscopus" class="uri">https://github.com/muschellij2/rscopus</a>), respectively (see Table 2).</p>
<table>
<caption><strong>Table 2.</strong> <em>API data gathering systems supported by bibliometrix</em></caption>
<colgroup>
<col width="16%"></col>
<col width="46%"></col>
<col width="27%"></col>
<col width="9%"></col>
</colgroup>
<thead>
<tr class="header">
<th>Source</th>
<th>API Key request</th>
<th>API Access</th>
<th>Format</th>
</tr>
</thead>
<tbody>
<tr class="odd">
<td>Dimensions</td>
<td>Free for scientometric projects <a href="https://www.dimensions.ai/scientometric-research/" class="uri">https://www.dimensions.ai/scientometric-research/</a></td>
<td>By an account and password<br />
</td>
<td>json</td>
</tr>
<tr class="even">
<td>PubMed</td>
<td>Free<br />
<a href="https://www.ncbi.nlm.nih.gov/account/" class="uri">https://www.ncbi.nlm.nih.gov/account/</a></td>
<td>Any key (3 requests/s)<br />
By a key (10 requests/s)</td>
<td>xml</td>
</tr>
<tr class="odd">
<td>Scopus</td>
<td>Commercial or Institutional subscription<br />
<a href="https://dev.elsevier.com/" class="uri">https://dev.elsevier.com/</a></td>
<td>By a key (10 requests/s)</td>
<td>xml</td>
</tr>
</tbody>
</table>
<p> </p>
<p> </p>
</div>
<div id="what-types-of-metadata-can-be-exported" class="section level2">
<h2>What types of metadata can be exported</h2>
<p>A bibliographic database a very rich set of information about a scientific document that we call ‘<strong>document metadata</strong>’ or ‘<strong>bibliographic metadata</strong>’.</p>
<p>It is possible to distinguish among five different types of metadata:</p>
<ul>
<li><em>Document info</em> (i.e. publication date, journal title, issue, volume, etc.)</li>
<li><em>Authors info</em> (i.e. authors’ name, affiliations, ORCID, etc.)</li>
<li><em>Content info</em> (i.e. title, abstract, authors’ keywords, etc.)</li>
<li><em>Citation info</em> (i.e. reference lists, number of citations, etc.)</li>
<li><em>Funding info</em> (i.e. Funding institutes, acknowledgments, etc.)</li>
</ul>
<p>Web of Science and Scopus allow the user to export the complete set of metadata. This means that it will be possible to perform all analyses implemented in bibliometrix. Some other databases, such as Dimensions, PubMed and Cochrane Library, export just a limited set of metadata types which implies some limitations in the choice of the analyses to carry out.</p>
<p>In Table 3, we show what kind of analyzes can be performed taking into account the metadata set that each database can export.</p>
<table>
<caption><strong>Table 3.</strong> <em>Type of metadata for each file format</em></caption>
<colgroup>
<col width="22%"></col>
<col width="27%"></col>
<col width="50%"></col>
</colgroup>
<thead>
<tr class="header">
<th>Source</th>
<th>Format</th>
<th>Exported metadata</th>
</tr>
</thead>
<tbody>
<tr class="odd">
<td>Web of Science</td>
<td><ul>
<li>‘BibTeX’</li>
<li>‘plaintext’</li>
<li>‘EndNote Desktop’</li>
</ul></td>
<td><ul>
<li>All</li>
<li>All</li>
<li>All</li>
</ul></td>
</tr>
<tr class="even">
<td>Scopus</td>
<td><ul>
<li>‘BibTeX’</li>
<li>‘csv’</li>
</ul></td>
<td><ul>
<li>All</li>
<li>All</li>
</ul></td>
</tr>
<tr class="odd">
<td>Dimensions</td>
<td><ul>
<li>‘Bibl. mapping’</li>
<li>‘Excel’</li>
<li>‘API’</li>
</ul></td>
<td><ul>
<li>Document, Authors, Citation info</li>
<li>Document, Authors, Content info</li>
<li>All but with limited reference info</li>
</ul></td>
</tr>
<tr class="even">
<td>PubMed</td>
<td><ul>
<li>‘PubMed export’</li>
<li>‘API’</li>
</ul></td>
<td><ul>
<li>Document, Authors, Content info</li>
<li>Document, Authors, Content info</li>
</ul></td>
</tr>
<tr class="odd">
<td>Cochrane Library</td>
<td><ul>
<li>‘plaintext’</li>
</ul></td>
<td><ul>
<li>Document, Authors, Content info</li>
</ul></td>
</tr>
</tbody>
</table>
<p> </p>
<p> </p>
</div>
<div id="how-to-import-a-set-of-export-data-files" class="section level2">
<h2>How to import a set of export data files</h2>
<p>As already stated, <em>bibliometrix</em> works with several data formats coming from different sources.</p>
<p>Here, we briefly show how to import and convert data from each bibliographic database.</p>
<p>An export file can be read and converted using the function <em>convert2df</em>:</p>
<pre><code>**convert2df**(*file*, *dbsource*, *format*)</code></pre>
<p>The argument <em>file</em> is a character vector containing the export file names. <em>file</em> can also contain the name of a JSON/XML object downloaded by Digital Science Dimensions or PubMed APIs (through the packages <em>dimensionsR</em> and <em>pubmedR</em>.</p>
<p>es. file &lt;- c(“file1.txt”, “file2.txt”, …)</p>
<p><em>convert2df</em> creates a bibliographic data frame with cases corresponding to manuscripts and variables to Field Tag in the original export file.</p>
<p><em>convert2df</em> accepts two additional arguments: <em>dbsource</em> and <em>format</em>.</p>
<p>The argument <em>dbsource</em> indicates from which database the collection has been downloaded.</p>
<p>It can be:</p>
<ul>
<li><p>“isi” or “wos” (for Clarivate Analytics Web of Science database),</p></li>
<li><p>“scopus” (for SCOPUS database),</p></li>
<li><p>“dimensions” (for DS Dimensions database)</p></li>
<li><p>“pubmed” (for PubMed/Medline database),</p></li>
<li><p>“cochrane” (for Cochrane Library database of systematic reviews).</p></li>
</ul>
<p>The argument <em>format</em> indicates the export file format. It can be one of the formats reported in Table 1.</p>
<p>Each manuscript contains several elements (metadata), such as authors’ names, title, keywords, and other information. All these elements constitute the bibliographic attributes of a document, also called metadata.</p>
<p>Data frame columns are named using the standard Clarivate Analytics WoS Field Tag codify.</p>
<p>The main field tags are:</p>
<table>
<thead>
<tr class="header">
<th>Field Tag</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr class="odd">
<td>AU</td>
<td>Authors’ Names</td>
</tr>
<tr class="even">
<td>TI</td>
<td>Document Title</td>
</tr>
<tr class="odd">
<td>SO</td>
<td>Journal Name (or Source)</td>
</tr>
<tr class="even">
<td>JI</td>
<td>ISO Source Abbreviation</td>
</tr>
<tr class="odd">
<td>DT</td>
<td>Document Type</td>
</tr>
<tr class="even">
<td>DE</td>
<td>Authors’ Keywords</td>
</tr>
<tr class="odd">
<td>ID</td>
<td>Keywords associated by SCOPUS or WoS database</td>
</tr>
<tr class="even">
<td>AB</td>
<td>Abstract</td>
</tr>
<tr class="odd">
<td>C1</td>
<td>Authors’ Affiliations</td>
</tr>
<tr class="even">
<td>RP</td>
<td>Corresponding Author’s Affiliation</td>
</tr>
<tr class="odd">
<td>CR</td>
<td>Cited References</td>
</tr>
<tr class="even">
<td>TC</td>
<td>Times Cited</td>
</tr>
<tr class="odd">
<td>PY</td>
<td>Publication Year</td>
</tr>
<tr class="even">
<td>SC</td>
<td>Subject Category</td>
</tr>
<tr class="odd">
<td>UT</td>
<td>Unique Article Identifier</td>
</tr>
<tr class="even">
<td>DB</td>
<td>Bibliographic Database</td>
</tr>
</tbody>
</table>
<p>For a complete list of field tags, see <a href="https://www.bibliometrix.org/documents/Field_Tags_bibliometrix.pdf" class="uri">https://www.bibliometrix.org/documents/Field_Tags_bibliometrix.pdf</a></p>
<p> </p>
<div id="how-to-import-data-from-web-of-science" class="section level3">
<h3>How to import data from Web of Science</h3>
<p><em>Clarivate Analytics Web of Science (WoS)</em> (<a href="https://www.webofknowledge.com" class="uri">https://www.webofknowledge.com</a>) is one of the world’s most trusted publisher-independent global citation databases. It was founded by Eugene Garfield, one of the pioneers of bibliometrics. This platform includes many different collections and allows users to export data in several different file formats.</p>
<p><em>bibliometrix</em> supports three WoS export file formats: BibTeX, plaintext and EndNote Desktop.</p>
<p>To access Web of Science services, a subscription is required. When downloading data from Web of Science, make sure that the Web of Science Core Collection database is selected. Choose the <strong>Export option</strong> followed by <strong>EndNote Desktop</strong> file format, or <strong>Other File Formats</strong> option, and choose either the <strong>Plain Text</strong> or the <strong>BibTeX</strong> file format. Although bibliometrix supports all these file formats, we recommend the use of the plaintext format. When asked which data elements to download, choose the <strong>Full Record and Cited References option</strong>. Downloading cited reference data is necessary for identifying citation, bibliographic coupling, and co-citation links between items.</p>
<p><strong>WoS BibTeX file format</strong></p>
<div class="sourceCode" id="cb3"><pre class="sourceCode r"><code class="sourceCode r"><span id="cb3-1"><a href="#cb3-1"></a><span class="kw">library</span>(bibliometrix)</span>
<span id="cb3-2"><a href="#cb3-2"></a></span>
<span id="cb3-3"><a href="#cb3-3"></a>file &lt;-<span class="st"> &quot;https://www.bibliometrix.org/datasets/wos_bibtex.bib&quot;</span></span>
<span id="cb3-4"><a href="#cb3-4"></a></span>
<span id="cb3-5"><a href="#cb3-5"></a>M &lt;-<span class="st"> </span><span class="kw">convert2df</span>(file, <span class="dt">dbsource =</span> <span class="st">&quot;wos&quot;</span>, <span class="dt">format =</span> <span class="st">&quot;bibtex&quot;</span>)</span>
<span id="cb3-6"><a href="#cb3-6"></a></span>
<span id="cb3-7"><a href="#cb3-7"></a><span class="kw">head</span>(M[<span class="st">&quot;TC&quot;</span>])</span></code></pre></div>
<p><strong>WoS plaintext file format</strong></p>
<div class="sourceCode" id="cb4"><pre class="sourceCode r"><code class="sourceCode r"><span id="cb4-1"><a href="#cb4-1"></a>file &lt;-<span class="st"> &quot;https://www.bibliometrix.org/datasets/wos_plaintext.txt&quot;</span></span>
<span id="cb4-2"><a href="#cb4-2"></a></span>
<span id="cb4-3"><a href="#cb4-3"></a>M &lt;-<span class="st"> </span><span class="kw">convert2df</span>(file, <span class="dt">dbsource =</span> <span class="st">&quot;wos&quot;</span>, <span class="dt">format =</span> <span class="st">&quot;plaintext&quot;</span>)</span>
<span id="cb4-4"><a href="#cb4-4"></a></span>
<span id="cb4-5"><a href="#cb4-5"></a><span class="kw">head</span>(M[<span class="st">&quot;TC&quot;</span>])</span></code></pre></div>
<p><strong>WoS EndNote Desktop file format</strong></p>
<div class="sourceCode" id="cb5"><pre class="sourceCode r"><code class="sourceCode r"><span id="cb5-1"><a href="#cb5-1"></a>file &lt;-<span class="st"> &quot;https://www.bibliometrix.org/datasets/wos_plaintext.ciw&quot;</span></span>
<span id="cb5-2"><a href="#cb5-2"></a></span>
<span id="cb5-3"><a href="#cb5-3"></a>M &lt;-<span class="st"> </span><span class="kw">convert2df</span>(file, <span class="dt">dbsource =</span> <span class="st">&quot;wos&quot;</span>, <span class="dt">format =</span> <span class="st">&quot;endnote&quot;</span>)</span>
<span id="cb5-4"><a href="#cb5-4"></a></span>
<span id="cb5-5"><a href="#cb5-5"></a><span class="kw">head</span>(M[<span class="st">&quot;TC&quot;</span>])</span></code></pre></div>
<p> </p>
</div>
<div id="how-to-import-data-from-scopus" class="section level3">
<h3>How to import data from Scopus</h3>
<p><em>Scopus</em> (<a href="https://www.scopus.com/" class="uri">https://www.scopus.com/</a>) is one of the largest abstract and citation database of peer-reviewed literature: scientific journals, books, and conference proceedings. Like WoS, Scopus allows to export data in several different file formats.</p>
<p><em>bibliometrix</em> supports two Scopus export file formats: BibTeX and csv (comma-separated values).</p>
<p>TO access Scopus services, a subscription is required. To download data from Scopus, choose the <strong>CSV</strong> or the <strong>BibTeX export</strong> option. (Do not choose the Download option!) Make sure that the data is downloaded in a ‘.csv’ or ‘.bib’ file and that all data elements are included. Although bibliometrix supports both file formats, we recommend the use of the CSV format.</p>
<p><strong>Scopus BibTeX file format</strong></p>
<div class="sourceCode" id="cb6"><pre class="sourceCode r"><code class="sourceCode r"><span id="cb6-1"><a href="#cb6-1"></a><span class="kw">library</span>(bibliometrix)</span>
<span id="cb6-2"><a href="#cb6-2"></a></span>
<span id="cb6-3"><a href="#cb6-3"></a>file &lt;-<span class="st"> &quot;https://www.bibliometrix.org/datasets/scopus_csv.csv&quot;</span></span>
<span id="cb6-4"><a href="#cb6-4"></a></span>
<span id="cb6-5"><a href="#cb6-5"></a>M &lt;-<span class="st"> </span><span class="kw">convert2df</span>(file, <span class="dt">dbsource =</span> <span class="st">&quot;scopus&quot;</span>, <span class="dt">format =</span> <span class="st">&quot;csv&quot;</span>)</span>
<span id="cb6-6"><a href="#cb6-6"></a></span>
<span id="cb6-7"><a href="#cb6-7"></a><span class="kw">head</span>(M[<span class="st">&quot;TC&quot;</span>])</span></code></pre></div>
<p><strong>Scopus csv file format</strong></p>
<div class="sourceCode" id="cb7"><pre class="sourceCode r"><code class="sourceCode r"><span id="cb7-1"><a href="#cb7-1"></a><span class="kw">library</span>(bibliometrix)</span>
<span id="cb7-2"><a href="#cb7-2"></a></span>
<span id="cb7-3"><a href="#cb7-3"></a>file &lt;-<span class="st"> &quot;https://www.bibliometrix.org/datasets/scopus_bibtex.bib&quot;</span></span>
<span id="cb7-4"><a href="#cb7-4"></a></span>
<span id="cb7-5"><a href="#cb7-5"></a>M &lt;-<span class="st"> </span><span class="kw">convert2df</span>(file, <span class="dt">dbsource =</span> <span class="st">&quot;scopus&quot;</span>, <span class="dt">format =</span> <span class="st">&quot;bibtex&quot;</span>)</span>
<span id="cb7-6"><a href="#cb7-6"></a></span>
<span id="cb7-7"><a href="#cb7-7"></a><span class="kw">head</span>(M[<span class="st">&quot;TC&quot;</span>])</span></code></pre></div>
<p> </p>
</div>
<div id="how-to-import-data-from-dimensions" class="section level3">
<h3>How to import data from Dimensions</h3>
<p><em>Dimensions</em> is s comprehensive database by Digital Science that doesn’t impose limitations on users. Like WoS, Scopus lets to export data in several different file formats. Dimensions is the only database that links publications and citations with grants, patents, clinical trials, datasets, and policy papers to deliver a more holistic view of the research landscape. Differently from WoS and Scopus, Dimensions allows exporting data in only two different file formats: excel and csv.</p>
<p>These two formats are not interchangeable but contain different types of metadata. In particular, Excel file contains metadata about document contents such as Abstract, Keywords, etc.. On the contrary, the csv file contains reference lists and affiliation names that can be used in citation and co-citation analyzes.</p>
<p><em>bibliometrix</em> supports both the two Dimensions export file formats: excel and csv (comma-separated values).</p>
<p>The free version of Dimensions, for which no subscription is needed, can be used. A user account is required. To download data from Dimensions, choose the Save / Export option, followed by the <strong>Export for bibliometric mapping or Excel option</strong>.</p>
<p><strong>Dimensions Excel file format</strong></p>
<div class="sourceCode" id="cb8"><pre class="sourceCode r"><code class="sourceCode r"><span id="cb8-1"><a href="#cb8-1"></a>file &lt;-<span class="st"> &quot;https://www.bibliometrix.org/datasets/dimensions_excel.xlsx&quot;</span></span>
<span id="cb8-2"><a href="#cb8-2"></a></span>
<span id="cb8-3"><a href="#cb8-3"></a>M &lt;-<span class="st"> </span><span class="kw">convert2df</span>(file, <span class="dt">dbsource =</span> <span class="st">&quot;dimensions&quot;</span>, <span class="dt">format =</span> <span class="st">&quot;excel&quot;</span>)</span>
<span id="cb8-4"><a href="#cb8-4"></a></span>
<span id="cb8-5"><a href="#cb8-5"></a><span class="kw">head</span>(M[<span class="st">&quot;TC&quot;</span>])</span></code></pre></div>
<p><strong>Dimensions csv file format</strong></p>
<div class="sourceCode" id="cb9"><pre class="sourceCode r"><code class="sourceCode r"><span id="cb9-1"><a href="#cb9-1"></a>file &lt;-<span class="st"> &quot;https://www.bibliometrix.org/datasets/dimensions_csv.csv&quot;</span></span>
<span id="cb9-2"><a href="#cb9-2"></a></span>
<span id="cb9-3"><a href="#cb9-3"></a>M &lt;-<span class="st"> </span><span class="kw">convert2df</span>(file, <span class="dt">dbsource =</span> <span class="st">&quot;dimensions&quot;</span>, <span class="dt">format =</span> <span class="st">&quot;csv&quot;</span>)</span>
<span id="cb9-4"><a href="#cb9-4"></a></span>
<span id="cb9-5"><a href="#cb9-5"></a><span class="kw">head</span>(M[<span class="st">&quot;TC&quot;</span>])</span></code></pre></div>
<p> </p>
</div>
<div id="how-to-import-data-from-pubmed" class="section level3">
<h3>How to import data from PubMed</h3>
<p><em>PubMed</em> comprises more than 30 million citations for biomedical literature from MEDLINE, life science journals, and online books. Citations may include links to full-text content from PubMed Central and publisher web sites.</p>
<p>PubMed allows user to export data in many different file formats but the most suitable for science mapping analysis is the “pubmed txt” format. However, Any of these file formats contain metadata about citations and reference lists. This means that is not possible to perform citation and co-citation analysis using PubMed data.</p>
<p><em>bibliometrix</em> supports only the “pubmed txt” file format.</p>
<p>PubMed can be accessed at <a href="https://pubmed.ncbi.nlm.nih.gov/" class="uri">https://pubmed.ncbi.nlm.nih.gov/</a>. To download data from PubMed, choose the <strong>Save option</strong>, choose <strong>All results</strong> as content selection, and choose <strong>PubMed</strong> as the file format. Data downloaded from PubMed cannot be used for identifying citation, bibliographic coupling, and co-citation links between items.</p>
<p><strong>PubMed txt file format</strong></p>
<div class="sourceCode" id="cb10"><pre class="sourceCode r"><code class="sourceCode r"><span id="cb10-1"><a href="#cb10-1"></a>file &lt;-<span class="st"> &quot;https://www.bibliometrix.org/datasets/pubmed_txt.txt&quot;</span></span>
<span id="cb10-2"><a href="#cb10-2"></a></span>
<span id="cb10-3"><a href="#cb10-3"></a>M &lt;-<span class="st"> </span><span class="kw">convert2df</span>(file, <span class="dt">dbsource =</span> <span class="st">&quot;pubmed&quot;</span>, <span class="dt">format =</span> <span class="st">&quot;pubmed&quot;</span>)</span>
<span id="cb10-4"><a href="#cb10-4"></a></span>
<span id="cb10-5"><a href="#cb10-5"></a><span class="kw">head</span>(M[<span class="st">&quot;TC&quot;</span>])</span></code></pre></div>
<p> </p>
</div>
<div id="how-to-import-data-from-cochrane-library" class="section level3">
<h3>How to import data from Cochrane Library</h3>
<p>The <em>Cochrane Database of Systematic Reviews (CDSR)</em>, owned by Cochrane Library, is a leading journal and database for systematic reviews in health care. CDSR includes Cochrane Reviews (systematic reviews) and protocols for Cochrane Reviews as well as editorials and supplements.</p>
<p>As for PubMed, CDSR allows to export many file formats but any of these contains metadata about citations and reference lists.</p>
<p><em>bibliometrix</em> supports the “plaintext” file format.</p>
<p>CDSR can be accessed at <a href="https://www.cochranelibrary.com/search" class="uri">https://www.cochranelibrary.com/search</a>. To download data from CDSRed, choose the <strong>Export selected citation(s) option</strong>, and choose <strong>Plain Text</strong> as the file format. Data downloaded from CDSR cannot be used for identifying citation, bibliographic coupling, and co-citation links between items.</p>
<p><strong>Cochrane plaintext file format</strong></p>
<div class="sourceCode" id="cb11"><pre class="sourceCode r"><code class="sourceCode r"><span id="cb11-1"><a href="#cb11-1"></a>file &lt;-<span class="st"> &quot;https://www.bibliometrix.org/datasets/cochrane_plaintext.txt&quot;</span></span>
<span id="cb11-2"><a href="#cb11-2"></a></span>
<span id="cb11-3"><a href="#cb11-3"></a>M &lt;-<span class="st"> </span><span class="kw">convert2df</span>(file, <span class="dt">dbsource =</span> <span class="st">&quot;cochrane&quot;</span>, <span class="dt">format =</span> <span class="st">&quot;plaintext&quot;</span>)</span>
<span id="cb11-4"><a href="#cb11-4"></a></span>
<span id="cb11-5"><a href="#cb11-5"></a><span class="kw">head</span>(M[<span class="st">&quot;TC&quot;</span>])</span></code></pre></div>
<p> </p>
<p> </p>
</div>
</div>
<div id="how-to-download-bibliographic-data-using-apis" class="section level2">
<h2>How to download bibliographic data using APIs</h2>
<p><em>bibliometrix</em> can download data through an API. The APIs supported by bibliometrix are listed in Table 2. The use of APIs requires an internet connection. bibliometrix will download data for all documents that match the specified search criteria defined by a query.</p>
<p>Dimensions API needs an account to obtain a valid token to query the database. The account can be obtain for free for scientometric reseaech project asking for it at <a href="https://www.dimensions.ai/scientometric-research/" class="uri">https://www.dimensions.ai/scientometric-research/</a>. Scopus API needs an API Key to work. Full Scopus APIs access is only granted to users with Scopus subscription (i.e. an academic subscription). These users can be request an API key at <a href="https://dev.elsevier.com/" class="uri">https://dev.elsevier.com/</a>. FInally, by default, the access to PubMed API is free and does not necessarily require an API key. In this case, PubMed limits users to making only 3 requests per second. Users who register for an API key are able to make up to ten requests per second.</p>
<p>Obtaing a key is very simple, you just need to register for “my ncbi account” (<a href="https://www.ncbi.nlm.nih.gov/account/" class="uri">https://www.ncbi.nlm.nih.gov/account/</a>) then click on a button in the “account settings page” (<a href="https://www.ncbi.nlm.nih.gov/account/settings/" class="uri">https://www.ncbi.nlm.nih.gov/account/settings/</a>).</p>
<p>At the moment, an important limitation of the Dimensions and PubMed APIs is that data downloaded through this API cannot be used for identifying citation, bibliographic coupling, and co-citation links between items.</p>
<p> </p>
<div id="how-to-import-data-using-dimensions-api" class="section level3">
<h3>How to import data using Dimensions API</h3>
<p>The <em>Dimensions API</em> provides access to Dimensions data directly, and makes it possible to retrieve results to precise and complex queries. These are performed using the <em>dimensionsR R-package</em> functions which implement the Dimensions Search Language (DSL), a custom query language created by Dimensions.</p>
<p>The user can choose to write a valid query using that language or, in alternaative, using the function <em>dsQueryBuild</em>.</p>
<p>For a more detailed focus on the use of the dimensionsR package, please see the package vignette at <a href="https://cran.r-project.org/package=dimensionsR/vignettes/A_Brief_Example.html" class="uri">https://cran.r-project.org/package=dimensionsR/vignettes/A_Brief_Example.html</a></p>
<p> </p>
</div>
<div id="how-to-import-data-using-pubmed-api" class="section level3">
<h3>How to import data using PubMed API</h3>
<p>The <em>PubMed API</em> provides access to PubMed data directly, submitting a query written following the pubmed query language. The PubMed data can be gathered using the <em>pubmedR R-package</em> functions.</p>
<p>For a more detailed focus on the use of the pubmedR package, please see the package vignette at <a href="https://cran.r-project.org/package=pubmedR/vignettes/A_Brief_Example.html" class="uri">https://cran.r-project.org/package=pubmedR/vignettes/A_Brief_Example.html</a></p>
<p> </p>
</div>
</div>



<!-- code folding -->


<!-- dynamically load mathjax for compatibility with self-contained -->
<script>
  (function () {
    var script = document.createElement("script");
    script.type = "text/javascript";
    script.src  = "https://mathjax.rstudio.com/latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML";
    document.getElementsByTagName("head")[0].appendChild(script);
  })();
</script>

</body>
</html>
